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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT AT AG AAAAT T TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

6 01 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

1251 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

1351 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGAT C ATTA AAGAAATAAA 

1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 

1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 

1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 

1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 

1751 GGGGTTTTGG GG 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 

2 51 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

3 51 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 

4 51 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 

5 01 CTCAAAAGCA GTATTTCTTT CAAGAC GAAT GGAACCAAGT TAGAG CAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 

6 01 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 

6 51 ATCATTTGAA AGT CAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGAT C A ACCTGCAAAT ACAATGTCAA 

7 51 GAATGAGAAA GATCACTTTC T CAACAAC AT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAAC CAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

10 01 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGT CGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

1251 AAAAGAAAGT TAAGAAATAT GTGGAAC TAA ACAAGCATGA ACTCATTCAC 

1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CAT GGATG C A 

1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

1501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 

1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 

1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

1651 AC CGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

1701 TAAATT CAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 

1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATC C 

1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 

1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAAC TAT C 

1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 

2 001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 

2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 

2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 

2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 

2 201 AGAAATTATT TTAAGAAAGA TAACTTACTT CAAC CAGTCA TTAATATTTG 

2 251 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

23 01 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 

2 3 51 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

25 01 ATAAACGTAA GTCGTGAAAA TGGATT TAAA TTCAATATGA AGAAACTACA 

25 51 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

26 51 TCAATTGATA TGAAAACTCT TGCTTTAATG C CAAATATTA AC TTGAGAAT 

27 01 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAAC CGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2 9 01 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGGAG TATGATCGAC 
29 51 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 0 01 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 0 51 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAG C AAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
32 01 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE L I DKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 

251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 

351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 

4 01 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 

451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMS I ADLKKE TLAEVQEKEV 

501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 

551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 

601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 

651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 

701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 

751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 

8 01 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 

851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 

901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 

951 LEVSKIIYSV TRAF FKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

! + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

PQNPKTPKPL*KKKKLR*FR 
PKTPKPQNPYKKRKN*GSLE- 
PKPQNPKTPIKKEKIEVV* K - 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

NKILFPHKWRWILIWMI*KI 
I KYYSRTNGDGY * FG* YRKF 
* N I I PAQMEMD I D LDD I ENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

YFLIHSTSIAALVVTRKDAK 
TS*YIQQV*QLL**QERMQN- 
LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

1 81 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH*LFQSCKNN* 
b IEIWLEIAFIDYSKVAKTIR- 
LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS *EN* F* 

b VLLLGCKSL*RFFLEKI SFK- 

C FYFSDAWLYNDSFLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY*CLNKIR* 

b KRRAKSRN*NITNV* IKSGN- 

c SGEQRVE I ETLLMFK* NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLRSIMEKIT* 

b EDYSI F* ITS*GALWRKLLN- 

c RIILFFRSLLKEHYGENYLI- 



FIG. 12 



11/78 

TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

Y*KVNSLDYFPSQQ**VY*I 
TKR*TVWIISLANNDEYIKF- 
LKGKQFGLFP* PTMMS I LNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

HMRMSQRISIHQTYQRQTRY 
I*E*VKGSRYIRLTKDKLAI- 
YENESKDLDTSDLPKTNSL*- 



541 



AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKV**SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

C NARKSLI IEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 



600 



601 



660 



661 



721 



720 



780 



ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

YGFYYNCFRYRR* T PE S * DN 
MGFITIVLGIDGELPSLETI ■ 
WVLLQLF*VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

*KSCLQLKESQF*KF*CVCH 
EKAVYN*RNRSSESSDVYAI 
KKLFTTEGIAVLKVLMCMPL 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

C FCELISNILSQFNG*L* KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

c K*TMQV*WNIR*ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD* FS 

b EFILDS*SIDTQNALETDLA- 

C NLYWILKA* IHRML*RLI * L - 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

+ + + + + + ! 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

LQQITCFDYSCSSLISLKEA 
YNRLPVLITLAHLLYL* KKQ 
TTDYLF*LLLLISYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLILL* P - 

C RNEKKTKERDFKIC*FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

10 21 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS* F 

b ELTTRILATKKKKSYHNPDS- 

c N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

10 8i + + + + + + H40 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS- 

c KDFKNSR*ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

c FHFTAVI FFYLNNI F * LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

c KKYQIREALD*GNLAYSHS*- 

agatcgaccttcatatatccaatacgatgataaggaaacagcagtcatccgttttaaaaa 
1261 + g~~~tgctactattcctttgtcct^ 1320 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

c IDLHI SNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * CYED* IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

C VL*GLNF* SQEMEP KS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL* IFR* *VLP 

b NCVDIAKESNS KS FVNKYYQ 

c IASILQKNRTLNLSLISITN- 



1441 



1501 



ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

ILIDCRD*RGNCTEDH*RNK 
S*LIEEIDEATAQKIIKEIK- 
LD*LKRLTRQLHRRSLKK*S 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

+ + + + + + : 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 



1500 



1560 



a VTFIN*RIN*ITNIEISDLQ 
LLLIRE*TKLLI*RSAIF" 



[it c NFY* LENKLNY*YRDQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

LTK*KLN*S*TIKNTNLGQN 
*RNKS* TKVRQ* KIQTLVKI 
DE I KAELKLDNKKYKPWS KY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

+ + + + + + : 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 



1561 



1621 



1620 



1680 



a IEEGKEDQLAKEKIRQ*IK* 

b LRKEKKTS*QKKK*GNK*NE- 

C *GRKRRPVSKRKNKAINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFI FFNNLLKRGV 

b YRSEEIKDLFFSIIY*KEGF- 

C TEVKK*KIYFFQ* FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
C GVLGFW 
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2 EVDVQNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|::: | :.::|: :| |.::| ==• ■ I I--U-I 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 6 2 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF . STGLMIELIDKCLVELL 100 
: .|.| ..|-. -1= -.:| I ...::|-: 

63 DERRYIITKALL. . . . EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 10 7 

101 SSSDVSDRQKLQCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 150 

108 CWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTKYLIFQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 200 

145 FDATEFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . .PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

.::|| .::|..: . |... ::|. -I ■ ■ \ ■ l- = 
182 SEFNEYQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

248 NRNNQFFKKHE F VSNKNNI SAMDRAQT I FTNI FRFNRI RKKL KD KV I E KI 2 97 

| . : . . . | : . . : . : : | . : : : | : . • : I I I • I I I 

221 . RKENLLTKLQAIKESEDKSKRETG DIMNVEDAIKALKPAVMKKI 264 

298 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 34 7 

| . . : I :• I ••= M l-l =■ = 

265 AKRQNAMK KHMKAPKI PNSTLE SKYLTFKD 294 

348 LFSYTTDNKCVTQFINEFFYNILPKDFLTGRNRKNFQKKVKKYVELNKHE 397 

|: | .|.||.|.: .. : :| -I 

295 LIKFCHISEP KERVYKI LGKKYPKTEEEYKAAFGDS AS APFN . PE 33 8 

398 LIHKNLLLEKINTREISWMQVETSAKHFYYFDHENIYVLWKLLRWIFEDL 447 
| | . . : | . | = | | - : : - | - . - = I I = = 

339 LAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSN. . 386 

448 WSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 497 

3 87 ILKAGVSD 394 

498 KEVEEWKKSLGFAPGKLRLIPKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 54 7 

395 TTHS 398 

548 KLLNSHLMLKTLKNRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 597 
| :|- I • I : • - h 

399 IVINK ICEPKAVENSKM 415 

598 FFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKNNIVID 647 

| :. .- :.|| | :.. :|..| •= = • = I I 1= l = = 

416 F. . PLQFFSAIEAVN . EAVTKGFKAKK . . . RENMNLKGQIEAVKE . .WE 457 

64 8 SKNFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 697 

..: | |:| .||..: ..:..| |. .| 

458 KTDEEKKDM ELEQTEEGEFVKVNEGIGKQYINSIELAIK 496 

698 AKQRNYFKKDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSS 747 

497 I AVNKNLDE I KGHTAI FS DVS GS MS TSMS GGAKKYGS VRTCLEC ALVLGL 546 

74 8 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFI 7 97 

547 MVKQRCEKSSFYIFSSPSSQCNKCYLEVDL 576 

FIG. 13 
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798 EKLINVSRENGFKFNMKK.LQTSFPLSPSKFAKYGMDSVEEQNIVQDYCD 846 

-: = ::= -U II-- -h- = = = = 1 I 

577 PGDELRPSMQKLLQEKGKLGGG . . TDFPYECIDEWTKNKTHVD 617 

84 7 WIGISIDMKTLALMPNINLRIEGILCTLNLNMQTKKASMWLKKKLKSFLM 896 

l-l II-. - - = lhl - = h - II- I- = 

618 NIVIL.SDMMIAEGYSDINVRGSSIVNSI KKYKDEVN 653 

897 NNITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD.HFKKNLAM 945 

1|. : .:• |::: |::. :.|:.: :: | ::| 

654 PNIKIF. . .AVDLEGYG KCLNLGD E FNENNY I K I FGM 687 

946 SSMIDIiEVSKIIYSVTRAFFKYLVCNIKDTIFGEEHYPDFFLSTLKHFIE 995 

I • I :|:: ... :. :::| 
688 SDSI LKFI S AKQGGA NMVE 706 

996 I FSTKKY I FNRVC 1008 

707 VI. .KNFALQKIG 717 

FIG. 13 

(CONTINUED) 



132 LSTQKQYFFQDEWNQVRAMIGNEL . FRHLYTKYLIFQRTSE . . GTLVQFC 17 8 
:| ..| ....| MM : :. .. | | | . 

1 MSRRNQ KKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQI 43 

179 GNNVFDHLKVNDKFDKKQKGGAADMNEPRCCSTCKYNVKNEKDHFLNNIN 228 
:: :. ||..:. :. ..|...| :| | .|..:.:|..:| 

44 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

229 VPNWNNMKSRTRIFYCTHFNRJSTNQFFKKHEFVSNKNNISAMDRAQTIFTN 278 
|::|| |:.| : . .| 

85 QIKQQVQLIKK. . . VGSKVEKDLNLNEDENKKN 114 

279 IFRFNRIRKKLKDKVIEKIAYMLEKVKDFNFNYYLTKSCPLPENWRERKQ 328 

■■■ ■■■■■■ I- • = I I--I-- • •••II • 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDY 164 

329 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE. FFYNILPKDFLTG 3 77 

..|.: | . : | . : | : | : || :: . 

165 DTEKWFEISHDQK NYVS I YANQKTSYCWWLKDYFNK 200 

3 78 RNRKNFQKKVKKYVELNKHELIHKNLLLEKINTREISWMQVETSAKHFYY 427 

201 NNYDHLNVSINRLE. . TEAEFYAFDDFSQTIKLTNNSYQTVNID 242 

428 FDHENIYVLWKLLRWI . . FEDLWSLIRCFFYVTEQQKSYSKTYYYRKNI 475 

.: :| |||:: :| : : ||: : | .| .:.|. . ..| 

243 VNFDNNLCILALLRFLLSLERFNILNIRSSY. . TRNQYNFEKIGELLETI 290 

4 76 WDVIMKMSIADLKKETLAEVQEKEVEEWKKSLGFAPGKLRLIPKKTTFRP 525 

:.|:. . .:|: .:: ..:.: ... . |. |.. 

291 FAWFSHR HLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQ 330 

526 IMTFNKKIVNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 575 

: |...: | |.|: . |:|.. .|. :. . . || |. 

331 VYSFSTDLKLVD . . TNKVQDYFKFLQEFPRLTHVSQQAIPVSATNAVESJL 3 78 



FIG. 14 
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576 DDVMKKYEEFVCKWKQVGQPKLF FATMDIEKCYDS . . VNREK 615 

: .:|| :. | |. | :. :.:| :.::| 

3 79 WVLLKKVKH ANLNLVSIPTQFNFDFYFVNLQHLKLEFGLEPNILTKQK 426 

516 LSTFL KTTKLLSSDFWIMTAQILKRKNNI . . VIDSKNFRKKEMK 657 

|...| |. |:| |:. || ||. : ... ||::... . 

427 LENLLL S I KQS KNL KFLRLNFYT YVAQETS RKQ I LKQ ATT I KNLKNNKNQ 476 

558 DYFRQKFQKIALEGGQYPTLFSVLEN . . EQNDLNAKKTLIVEAKQRNYFK 705 

- • ■ h II I 'I:--. ::|-l I 

477 EETPETKDETPSESTSGMKFFDHLSELTELEDFSVN. . . .LQATQEIY. . 520 

706 KDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSSFYYATLEE 755 

II I -I :.:... | |.||: 
521 .DSLHKLLIRSTNIiKKFKLSYKYEMEKSKMDTFIDLKNI YETLNN 564 

756 SSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFIEKLINVSR 3 05 

|: | | .||: | ||: .:..:: [|. 

565 LKRCSVNISNPHGNISYELTN KDSTFYKFKLTLNQE 500 

806 ENGFKFNMKKLQTSFPLSPSKFAKYGMDSVEEQNIVQDYCDWIGISIDMK 855 

|:.:| |..|.:. | ||.: .|:|. : ::..|..|: : ::. 
601 LQHAKYTFK. . QNEFQFNNVKSAKIESSSLESLEDIDSLCKSIASCKNLQ 648 

856 TLALMPNINLRIEGILCTLNLNMQT. . KKASMWLKK. . KLKSFLMNNITH 901 

| ::|:. |:.... |.. :::|. .||.: .|. 
64 9 NVNI IAS LLYPNNI QKNP FNKPNLLFFKQFEQLKNLENVS INC 6 91 

902 YFRKTI . . . TTEDFANKTLNKLFISGGYKYMQCAKEYKDHFKKNLAMSSM 948 

.: . | | | || | .:|: :|.. ||. .:..: 

692 ILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPEL 741 

949 IDLEVSKIIYSVT RAFFKYLVCNIKDT . . IFGEEHY 982 

| :||:. |. ||:. .:. .: 

742 NQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDF 791 

983 PDFFLS TLKHFIEIFSTKKY IFNRVCMILKAKEAKLKSDQCQSLIQ 1028 

.: :| .| | :|..|| . |: ... | ..::.|.|:. 

792 DQNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLK 840 

FIG. 14 
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4 DIDLDDIENLLPNTFNKYSSSCSDKKGCKTLKSGSKSPSLTIPK 4 7 

::. ..||. :.::... | |.. :||-l---- =ll -I- 

617 NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNI IASLLYPNNIQKNP 666 

4 8 LQKQLEFYFSDANLYNDSFLRKLVIjKSGEQRVE. . . . IETLLM 8 6 

: || = l • • -|: ::s| =1-1 •= =1 I--- 

667 FNKPNLLF FKQFEQLKNLENVS INCI LDQHI LNS I SEFLEKNKKI KAF I L 716 

FIG. 15 
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1 MEMDIDLDDIENL LPNTFNKYSSSCSDKKGCKTLKSGSKSPS . . . 42 

|: |. . .|| ...|.. |:| |...:. . | || ..: 

4 91 IELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLEC 540 

43 . LTIPKLQKQ LEFYFSDANLYNDSFLRKLVLKSGEQRVEIETLL 85 

|.: : M : :: |... :|.::| .: |.::| |..::.|| 

541 ALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL . EVDLPGDELRPSMQKLL 589 

FIG. 16 



telomerase p43 LQK 2 L E F YE SDA LYN D S FI RKLVLKSGEQR V E I ETLLM 

human La ICH2CJE YYFGDFNLPRDKFIKEQI . KLDEGWVPLEIMIK 

Xenopus LaA ICE2IE YYEGDHKLPRDKFLKQQI . LLDDGWVPLETMIK 

Drosophila La ilr; VEYYEGDAKLNRDKFIREQIGKNEDGWVPLSVLVT 

S. C. Lhplp CLKgV§F|YJSEFyFPYgRyRTTAEK.NDGwjv|piSTIAT 
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tagcagttaa 
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gtctcgagtg 
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gattgataaa 
aatggaaatt 
tgaattagag 
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gtacttcaac 
ggttctctat 
ataagatatt 
gttttctgaa 
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aagcagtaaa 
gaaaacaact 
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ctcgtgaacg 
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attgtagcat 
aaagcagtac 
atttttgatg 
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cgttacctct 
aatctcttaa 
ggagacataa 
atagccaaga 
accttggaat 
cctaaagaaa 
aaagcagcct 
atgaagattg 
gaggtttggg 
ttgtctaaca 
atttgtgagc 
attgaagctg 
aatcttaaag 
aaagatatgg 
ggcaagcaat 
gatgaaatca 
atgtcaggtg 
cttggtttga 
agttctcaat 
tctatgtaaa 
gagtgcattg 
gatatgatga 
agcatcaaaa 
gaaggttacg 
atattcggta 
atggtcgaag 
gattcttcta 
atttaaatta 
gcctattcaa 



taaaacccaa 
aagcacaata 
accaaaatga 
tcactgcagc 
atatcatcac 
tggcagtcta 
tttgtgttgt 
ttttgcctaa 
caactgaatt 
tcactttccg 
actaacttgg 
cagttaccaa 
ccaaacttta 
tgaacgttga 
gatagaatgc 
caaagtactt 
gagtctataa 
ttggtgattc 
aaatctctaa 
ataatttaat 
tcttaaaagc 
ccaaggccgt 
ttaatgaagc 
gtcaaatcga 
agttggagta 
acattaactc 
aaggacacac 
gagccaagaa 
tggtaaaata 
gcaataagtg 
aacttttgca 
atgaatggac 
ttgcagaagg 
agtacaagga 
gaaagtgcct 
tgagcgattc 
ttatcaaaaa 



ttatctattt 
atgattctgc 
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MEI ENNQAQQPKAEKLWWELELEMQENQND I QVRVKI DDPKQYL 
VOTTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCWHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINSIELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGA 
KKYGS VRTCLECALVLGLMVKQRCEKS SFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNI VI LSDMMI AEGYSD INVRGS S IVNS I KKYKDEVN PN I KI FA 
VDLEGYGKCLNLGDEFNENNYIKIFGMSDSILKFISAKQGGANM 
VEVIKNFALQKIGQK 

FIG. 20 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVS I YANQKTS YCWWLKDYFNKNNYDHLNVS INRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQ I S VKDS QLQVYS FSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNI LTKQKLENLLLS I KQS KNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVS INC ILDQHI LNS ISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQ WENHKQKAF YE PLCE F I KE S S QTLQL I D FD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
NYT VI QFNGQ F FTQ I VGNRCNE PHL PP KWVQRS S S S SATAAQ I K 
QLTEPVTNKQFLHKLNINSSSFFPYSKTLPSSSSIKKLTDLREA 
I FPTNLVKI PQRLKVRINLTLQKLLKRHKRLNYVS ILNS ICPPL 
EGTVLDLSHLSRQS P KERVLKFI I VI LQKLLPQEMFGS KKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTIVYFR 
HDTWNKL I TP F I VE Y FKTYLVENNVCRNHNS YTLSNFNHS KMRI 
IPKKSNNEFRIIAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNASRVPKPYELYIDNVRTVHLSNQDVINWEME I FKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLI LKLADD FLU STDQQQVI N I KKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
S KG I FRS L I AL FNTRIS YKTI DTNLNS TNTVLMQ I DHVVKNI S E 
CYKSAFKDLS INVTQNMQFHS FLQRI I EMTVSGCP ITKCDPLI E 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
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1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 

61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 

121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 

181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 

241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 

301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 

3 61 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 

421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 

481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 

541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 

601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 

661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 

721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 

781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 

841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 

901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 

961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 

1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 

1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 

1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 

1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 

1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 

1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 

1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 

1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 

1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 

1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 

1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 

1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 

1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 

1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 

1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 

1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 

1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 

2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 

2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 

2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 

2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 

2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 

2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 

2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 

2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 

2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 

2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 

2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 

2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 

2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 

2 821 aaaaaatcg 
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Oxyt r i cha LCVS YI LS S FYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 

FIG. 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYWELLRS FF YVTETTFQKNRLFFYRKSVWS KLQSIGI RQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIWMDYVVGARTFRREK 
RAERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAAT C AGACAG CACTTG AAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
TIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 
LINAFQVKQLHKVI PLVSQSTWPKRLLKVYPL IEQTAKRLHRI S LS KVYNHYCPYI DTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEI ILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFI TSMKMEAFEKINENNVRMDTQKTTLP PAVI RLLP KKNTFRLI TNLRKRFL I KMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKWQLLSMKTSDTLFVDFVDY 
WTKSS SEI FKMLKEHLSGHI VKI GNSQYLQKVG I PQGS I LS S FLCHF YMEDLI DEYLS FTKKKGS VL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
IYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 

S-2: RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 



A-l: AKF LHW 
A- 2 : LFF YRK 
A- 3 : PAL LTS 



LMS VYV VEL LRS 
SVW SKL QSI GIR 
RLR FIP KPD GLR 

FIG. 32 



FFY VTE TTF Q 
QHL KRV QLR DVS 
PIV NMD YW 



ta a g c c teg 
cag acc aaa gga att cca taa gg 
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5(C) 
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Genomic DNA Insert 



5825 bp Sequenced 
2 kb Hind ill Fragment 



FIG. 33A 



RT Motifs 12 3(A) 4(B) 5(C)6(C) 

n I -alffi alt 3 m imwws 




4-1 cDNA 
2-3 & 5-20 cDNA 



5' RT-PCR W/M2-B14 
5' RT-PCR W/M2-B15 Band A 
5" RT-PCR W/M2-B15 Band B 
5' RT-PCR W/M2-B16 Band C 



FIG. 33B 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



Telomerase Gene::M 



Wild Type Telomerase Gene 




2. Assay in selective media 



3. Sporulate, and grow on 
selective media 



Wild Type Telomerase Gene 




Wild Type (X2) 



Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 
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FIG. 45 
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met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GGA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 HO 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 



FIG. 47 
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210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 

390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 



FIG. 47 

(CONTINUED) 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 

CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 
510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC « 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . . . 
. FIIPILQSFFYITESSDLRNRT . . . 
.LIPKIIQTFFYCTEISSTVTIV. . . 
. YWELLRSFFYVTETTFQKNRL . . . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 

Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 

consensus 



p hhh K hR h R 

. KSLGFAPGKLRLI PKKT - - TFRPIMTFNKKI V . 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL . 
.TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD . 
. ARPALLTSRLRFIPKPD- -GLRPIVNMDYWG . 



R PK 



AF 



R I 



h hDh 

. PKLFFATMDIEKCYDSVNREKLSTFLK. . . 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK . . . 
.PELYFMKFDVKSCYDSIPRMECMRILK. . . 
. PELYFVKVDVTGAYDTI PQDRLTEVIA . . . // . 
F D YD 

hPQG pS hh 
.NGKFYKQTKGIPQGLCVSSILSSFYYA. . . 
. GNSQYLQKVGIPQGSILSSFLCHFYME . . . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . . . 
. RATSYVQCQGIPQGSILSTLLCSLCYG . . . 
G QG S 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 

Motif D 
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h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN . 
. KKGSVLLRWDDFLFITVNKKD . 
. SQDTLILKLADDFLIISTDQQQ . 
. RRDGLLLRL VDD FLL VT PHLTH . 
DD L 

Gh h cK 
. NVSRENGFKFNMKKL . . . 
. LNLSLRGFEKHNFST . . . 
.KKLAMGGFQKYNAKA. . . 
. LRTLVRGVPEYGCW . . . 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

2 01 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC C CTGGGANGN ANGGCNGCCC 

2 51 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

3 01 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

4 01 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
651 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
851 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
901 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

1001 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

12 01 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

13 01 GCGGTCACCC CAGCAGC CGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

14 01 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1451 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTCAGGAAC AC CAAGAAGT TCATCTCCCT GGGGAAG CAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1601 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1651 ACCGTCTGCG TGAGGAGATC CTGGC CAAGT TCCTGCACTG GCTGATGAGT 
17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGAC CAC 
1751 GTTT CAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1801 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1851 C TGT CGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1901 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAGC CAGAA CGTTCCGCAG AGAAAAGAGG 
2 001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
2 051 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 

22 01 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
2 2 51 AAC C C CAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 

23 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
2351 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 

24 01 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
2451 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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25 01 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGC CAGGGG ATCCCGCAGG 
2551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
2S01 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

26 51 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

27 01 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 

2 801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
2851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
2901 CGGACCTCCA T C AG AG C CAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
2951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGT CACA 
3001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 

3 051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 

4" 3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 

rU 3201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 

3251 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 

3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 

33 51 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 

34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT C AAGAC CAT C CTGGACTGAT 
3451 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACAC CAGCA GCCCTGTCAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
3651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3 751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3 801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CAT C C AGGTG GAGACCCTGA 
3 851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3 9 01 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGT CAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

! + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

AALRPAAHVGS PGPGHPRDA 
QRCVLLRTWEALAPATPA. MP - 
SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

61 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

ARSPLPSRALPAAQPLPRGA 
RAPRCRAVRSLLRSHYREVL- 
ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

12 i + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

AAGHVRAAPGAPGLAAGAAR 
PLATFVRRLGPQGWRLVQRG- 
RWPRS CGAWGPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

GPGGFPR?GGP??GVRALG? 
DPAAFRA?VA?C?VCVPW? ? 
TRRLSARWWP7AWCACPG? ? 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

'AAPRRPLLPPGVLPE? ?GG 
??PPAAPSFRQVSCL??LVA 
G ?PPPPPPSARCPA*??WWP 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

PSAA?AVR?RRE?RAGLRLR 
RVL? ?LC?RGA?NVLAFGFA 
ECC? ?CA?AAR?TCWPSASR 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

AAGRGPRGPPRGLHHQRAQL 
LLDGARGGPPEAFTTSVRSY 
CWTGPAGAPPRPSPPACAAT 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG ^ 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

PAQHGDRRTAGERGVGAAAA 
LPNTVTDALRGSGAWGLLLR- 
CPTR* PTHCGGAGRGGCCCA- 
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PRGRRRAGSPAGTLR? ? CAG 
RVGDDVLVHLLARCA? F V L V - 
AWATTCWFTCWHAAR? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

+ + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

GSQLRIjP ?vraaavparrc ? 
?PSCAY?VCGPPLYQLGAAT- 
?PAAPT?CAGRRCTSSAL?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

L + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

SGPAPATR?WTR?RLGSNGP 
QARPPPHA?GPE?VWDPTGL- 
RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

L + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

GTIASGRPGSPWAASPGCEE 
EP*RQGGRGPPGLPAPGARR- 
NHSVREAGVPLGCQPRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

± + + + + + + 78C 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

ARGQCQP KSAVAQEAQAWRC 
RGGSASRSLPLPKRPRRGAA- 
AGAVPAEVCRC PRG PGVAL P - 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

x + + + + + + 84C 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

P*AGADARWAGVLGPPGQDA 
PEPERTPVGQGSWAHPGRTP - 
LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

]_ + + + + + + 90( 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

WTE* PWFLCGVTCQTRRRSH 
GPSDRGFCVVSPARPAEEAT - 
DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

a + + + + + + 96' 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

LFGGCALWHAPLPPIRGPPA 
SLEGALSGTRHSHPSVGRQH- 
LWRVRS LARAT PTHPWAAS T- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 
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PRGPPIHIAATTSWDTPCPP 
HAGPPSTSRPPRPGTRLVPR- 
TRAPHPHRGHHVLGHALS P G - 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

10 2l + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCNTGTGACGCNGGGAG 

VYAETKHFLYSSGDK?TA?L 
CTPRPSTSSTPQAT7TLRPS- 
VRRDQALPLLLRRQ7HC? P P - 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

' GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

LPTQYI*GPA*LAFGRFVET 
FLLNISEAQPDWRSGGSWR? - 
SYSIYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

114 1 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

?FLVPGLGCQDSPQVAPPAP 
SFWFQALDARIPRRLPRLPQ" 
LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

+ + + + + + - 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

APLANAAPVSGAAWEPRAVP 
RYWQMRPLFLELLGNHAQCP - 
? TGKCGP CFWS CLGTTRSAP 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

+ + + + + + ] 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

LRGVPQDALPAASCGHPSSR 
YGVFLKTHCPLRAAVTPAAG - 
TGCSSRRTARCELRSPQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

CLCPGEAPGLCGGPRGGGTQ 
VCAREKPQGSVAAPEEEEHR- 
SVPGRSPRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 
1381 + + + + + + 1440 



1201 ■ 



1261 • 



1260 



■ 1320 



TPVAWCSCSASTAAPGRCTA 
PPSPGAAAPPAQQPLAGVRL- 
PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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SCGPACAGWCPQASGAPGTT 
RAGLPAPAGAPRPLGLQAQR- 
VRACLRRLVPPGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 

150 i + GTGGTTCTTCAAGTAGAGGGA^ 156 ° 

NAASSGTPRSSSPWGSMPSS 
TPLPQEHQEVHLPGEACQAL- 
RRFLRNTKKFI SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCC 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

RCRS *RGR*ACGTALGCAGA 
AAGADVEDERAGLRLAAQEP - 
LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

QGLAVFRPQSTVCVRRSWPS 
RGWLCSGRRAPSA*GDPGQV- 
GVGCVPAAEHRLREEILAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCA 

168 1 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

SCTG**VCTSSSCSGLSFMS 
PALADECVRRRAAQVFLLCH - 
LHWLMSVYVVELLRS FFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGT 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 

RRPRFKRTGSFSTGRVSGAS 
GDHVSKEQALFLPEECLEQV- 
ETTFQKNRLFFYRKSVWS KL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAG 

180 i + + + + + + I860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

CKALESDST*RGCSCGSCRK 
AKHWNQTALEEGAAAGAVGS - 
QSIGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

QRSGSIGKPGPPC*RPDSAS 
RGQAASGSQARPADVQTPLH - 
EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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SPSLTGCGRL* TWTTSWEPE 
PQA* RAAADCEHGLRRGSQN 
P KPDGLRP I VNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG ^ 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGACAAGTCGC 

RSAEKRGPSVSPRG*RHCSA 
VPQRKEGRASHLEGEGTVQR- 
FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRPPGRLCAGPG- 
LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

TISTGPGAPSCCVCGPRTRR 
RYPQGLAHLRAACAGPGPAA- 
DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + + + + + + 22 20 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

LSCTLSRWM*RARTTPSPRT 
*AVLCQGGCDGRVRHHPPGQ 
ELYFVKVDVTGAYDT I PQDR- 



2221 



GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAGCCA 

GSRRSSPASSNPRTRTACVG 
AHGGHRQHHQTPEHVLRASV 
LTEVIAS I I KPQNTYCVRRY- 



2280 



MPWSRRPPMGTSARPSRATS 
CRGPEGRPWARPQGLQEPRL- 
AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGGACGTCCTNTTGT 

LP*QTSSRTCDSSWLTCR?T 
YLDRPPAVHATVRGSPAG?Q- 
TLTDLQPYMRQFVAHLQ ?NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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AR*GMPSSSSRAPP*MRPAV 
PAEGCRRHRAELLPE*GQQW- 
PLRDAVVI EQS S SLNEAS SG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

ASSTSSYASCATTPCASGAS 
PLRRLPTLHVPPRRAHQGQV- 
LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

PTSSARGSRRAPSSPRCSAA 
LRPVPGDPAGLHPLHAALQP - 
YVQCQGIPQGSILSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 
2581 + + + + + + 2640 

CATATWRTSCLRGFGGTGCS 
VLRRHGEQAVCGDSAGRAAP - 
CYGDMENKLFAGIRRDGLLL- 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGGAAGG 

CVWWMISCW*HLTSPTRKPS 
AFGG* FLVGDTSPHPRENLP 
RLVDDFLLVTPHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 + + + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

SGPWSEVSLSMAAW*TCGRQ 
QDPGPRCP*VWLRGELAEDS - 
RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

2761 + + + + + + 2820 

ACCACTTG^GGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

W*TSL*KTRPWVARLLFRCR 
GELPCRRRGPGWHGFCSDAG- 
VNFPVEDEALGGTAFVQMPA- 

2821 ■ 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

PTAYSPGAACCWIPGPWRCR 
PRP I PLVRPAAGYPD PGGAE 
HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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ATTPAMPGPPSEPVSPSTAA 
RLLQLCPDLHQSQSHLQPRL- 
DYSSYARTSIRASLTFNRGF- 



' AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 



SRLGGTCVANSLGSCG 
Q G W E E H A S Q T L_ W_ G L_ A_ A^ 



S V T 
V S Q 

K ~A "G ~R N ~M " R R ~K L F G V L R L K C H S 



GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGA 

+ + + + + + 3oe 

' CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

ACFWICR*TASRRCAPTSTR 
PVSGFAGEQPPDGVHQHLQD- 
LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGC 

L + + + + + + 3i: 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

SSCCRRTGFTHVCCSSHFIS 
PPAAGVQVSRMCAAAPISSA- 
LLLQAYRFHACVLQLPFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCT 

L + + + + + + 311 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

KFGRTPHFSCASSLTRPPSA 
SLEEPHIFPARHL*HGLPLL- 
VWKNPTFFLRVISDTASLCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC ^ ^ 

1 TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

TPS * KPRTQGCRWGPRAP PA 
LHPESQERRDVAGGQGRRRP - 
S ILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

! + + + + + + 33 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

LCPPRPCSGCATKHSCSS*L 
SALRGRAVAVPPSIPAQADS- 
LPSEAVQWLCHQAFLLKLTR- 



CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

DTVSPTCHSWGHSGQPRRS* 
TPCHLRATPGVTQDSPDAAE- 
HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC _ 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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VGSSRGRR*LPWRPQPTRHC 
SEAPGDDADCPGGRSQPGTA- 
RKLPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCA 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

PQTSRPSWTDGHPPTARPRA 
LRLQDHPGLMATRPQPGREQ- 
SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 
3481 _ + + - + + + + 3540 

DTSSPVTPGSTSQGGRGGPH 
TPAALSRRALRPREGGAAHT - 
HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

3541 + + + + + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

PGPHRWESEA*VSVWPRPAC 
QARTAGSLRPE*VFGRGLHV- 
RPAPLGV*GLSECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 + + + + + + 3660 

GGCCX3ACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

PAEG*VSG*GLSECPAKG*V 
RLKAECPAEA*ASVQPRAEC- 
G*RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

SSTPAVFTSPQAGARLHPRA 
PAHLPSSLPHRLALGSTPGP - 
QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 
3721 + + + + + + 3780 

SFSSPGARLPLPT*E*SIPR 
AFPHQEPGFHSPHRNSPSPD- 
LFLTRSPASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

FAIVHPSPCPPLPSTPTIQV 
SPLFTPRPALLCLPPPPSRW- 
RHCS PLALPS FAFHPHH PGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 
b RP * EGPWELWE FGVTKGVPC 

C DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGT 

U 3901 + + + + + + 3960 

[ m ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

CI a YTGEDPAPGWGSLWVKLGGG 

■ % b TQARTLHLDGGPCGSNWGEV- 

X; c HRRGP CTWMGVPVGQ I GGR C - 

fl I GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 

]™ 3961 + + + + + + 4020 

*f CGACACCCTCATTTTATGACTTATATACTCAAAAAGTCAAAACTTTTTTTTTTTTTTTTT 



o 
m 



AVGVKY* IYEFFSFEKKKKK 
LWE*NTEYMSFSVLKKKKKK- 
CGSKILNI *VFQF* KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

K K K 
K K 
K K 
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met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 
10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

qly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arq pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 9° 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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200 210 

leu gly cys glu arg ala trp asn his ser val arg glu ala gly 

CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 
220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 

GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 3°° 

gly ala leu ser gly thr arg his ser his pro ser val gly arg 

GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 
310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys has phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
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430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 

440 450 
ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg lys ser val trp ser lys leu gin ser 

AGG CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 S30 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 
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650 660 

arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 

AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 

GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 
700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 

thr ala phe val gin met pro ala his gly leu phe pro trp cys 

ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 

GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser tyr ala arg thr ser ile arg ala ser leu thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC 

970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1120 




1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGG C C AC C C G C C C AC AGC C AGGC C GAGAGC AGA 



CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 
AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 
GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 
CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 
CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 
CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 
GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 
CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 
TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA 
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